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Method of Characterization of Biological Entities 



FIELD OF THE INVENTION 
The field of the invention is the field of infrared spectra of biological entities such as single 
exfoliated cells from normal and abnormal patients. Differences in such spectra can be used to 
detect cancer in samples of cells and tissues, and can be used as a screening test. 

6 

RELATED PATENTS AND APPLICATIONS 
This application claims priority from a provisional application 60/33,529 filed 1/22/99 
entitled "A system and method to determine the absence or presence of cancerous disease by 
infrared spectroscopy", by Diem et al. 



1 1 BACKGROUND OF THE INVENTION 

Previous papers and patents claimed to be able to detect the differences between normal 
and abnormal (pre-cancerous and cancerous) cells and tissue by inspection of the infrared spectra 
of these cells and tissues. Although some of these patent applications and scientific reports present 
al least partially valid data, the interpretation of these data mostly lacks the specific understanding 

16 of the origin of spectral differences between normal and abnormal cells and tissues. 

Early patent applications and scientific reports by Wong and coworkers were based on 
faulty interpretation of spectral differences in cervical and other cells and tissues. These studies 
failed to take into account the specfral changes in cells and tissues associated with maturation and 
differentiation of cells. Since certain cancerous and pre-cancerous diseases are accompanied by 

2 1 disruptions of the regular maturation of cells in tissues, some weak correlation between cancerous 
disease and spectral features was observed. The inconsistencies of the correlations were blamed 
on failures of standard methods of cytology and pathology. 

Although some of the shortcomings of the earlier patents had been established, US Patent 
# 5,733,739 which amplifies the misinterpretations of earlier reports and patents, and uses data that 

26 are clearly misinterpreted, has issued. For example, the patent used infrared (IR) spectral data from 
extracellular materials, such as mucus, and other confounding factors such as blood cells, for the 
interpretation of the spectral characteristics of cervical cells. Thus, most data used in their patent 
are unrelated to actual cancerous and pre-cancerous disease but rather to gross spectral changes due 
to contamination of cervical cells. The actual specfral changes due to cancerous disease, to be 



1 discussed below, cannot be detected by the crude methods described in patent No. 5,733,739. 

US Patent 5,596,992 uses infrared spectroscopy to distinguish normal from cancerous 
leukocytes and other cells by multivariate statistical methods. These studies use highly 
homogeneous samples and, therefore, have a much higher success in predicting disease from 
infrared data. However, they have failed to realize a source of spectral heterogeneity that confounds 
6 the interpretation of the data, and is due to the stages of cells' reproductive cycle. 

We have established that identical and highly pure cells still present specfral heterogeneity 
due to the differences in their development. Only when cells are separated into homogeneous 
fractions accordmg to their stage in the cell cycle will homogeneous specfral patterns be observed. 
Under these circumstances, single cells in one given stage exhibit spectral characteristics that can 
1 1 be directly related to the presence of cancer. Thus, the understanding of the cellular biology 
underlying the cell's reproductive cycle is necessary for a reliable diagnosis of disease. A method 
will be reported here that allows the detection of single cells that carry the signature of cancerous 
disease. 



OBJECTS OF THE INVENTION 
16 It is an object of the invention to provide an apparatus and a method for determining 

characteristics of large numbers of biological entities such as cells. 

It is an object of the invention to distinguish normal from abnormal cell populations by 
statistical analysis of characteristics of a large number of single cells or other entities. 

It is an object of the invention to provide an apparatus for measuring the infrared vibrational 
2 1 spectrum of large numbers of single cells or other entities. 

It is an object of the invention to provide an apparatus which measures the infrared 
vibrational spectrum of such a large number of cells or entities that meaningfiil statistical analysis 
is possible, in a in a time short enough that the process may be carried out at low cost. 

SUMMARY OF THE INVENTION 
26 Apparatus and a method for using the apparatus to determine the infrared vibrational 

spectral absorption of a large number of individual cells or other biological entities is disclosed. 
The infrared vibrational spectra characterizing the presence of DNA in the cells is used to 
determine the statistical proportion of the cells in a non quiescent state, the that proportion is used 
to determine if the cells represent a cell population having cancerous or other anomalous 
3 1 characteristics. 
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BRIEF DESCRIPTION OF THE DRAWINGS 



Figure 1 depicts schematically the stages of the division cycles somatic cells undergo. In 
cancerous cell cultures, cells often cycle directly from one division cycle into the next; i.e., they 
are dividing constantly. The width of each slice shown is approximately proportional to the 
percentage of time the cells spend in a given stage. 
6 Figure 2A shows a typical infrared absorption spectrum of a protein film. The peaks are 

referred to by standard nomenclature of vibrations giving rise to the absorptions. Figure 2B and 
Fig. 2C show the infrared absorption spectra of DNA and RNA respectively. The horizontal axis 
(abscissa) describes the wavelength of the infrared radiation, expressed in units of inverse length. 
The ordinate denotes the amount of light absorbed by a vibration (absorbance) and is presented in 

1 1 arbitrary units. Single cells typically exhibit between 0.05 and 0.2 absorbance units. 

Figure 3 shows the infrared traces of DNA and RNA, each superimposed on a protein 
background spectrum. Protein and nucleic acid intensities are adjusted such that their intensities 
correspond approximately to the intensity ratios observed in cells and tissues. Note that the 
resulting spectral composites can be easily distinguished; thus, the spectral patterns in Figure 3 can 

16 be used to judge whether DNA or RNA constitutes most of the observable nucleic acid in a cell 
or tissue. 

Figure 4 shows the different spectral fraces observed for cultured cells separated into 
different phases of their division cycle by elutriation. Trace A shows the averaged spectrum 
obtained for an exponentially growing cell culture, that is, a mixture of all cell division phases. 

2 1 Trace B shows a spectrum for cells in the Gl phase, trace C in the early S phase, frace D in the late 
S phase and trace E in the G2 phase. Note the differences in the amount of DNA observable 
between Gl and G2 phase on one hand, and the S phase on the other. 

Figure 5 shows a comparison between normal and abnormal squamous cells, clustered by 
cell cycle phase, hi Figure 5A, the traces of normal and abnormal cells are virtually 

26 indistinguishable, indicating that some cells in a sample of abnormal cells maintain normal spectral 
properties. We attribute these spectral patterns to be associated with inactive (non-dividing) cells 
of the GO phase. The spectral traces in Figure 5B are believed to be due to the Gl phase. The 
traces due to abnormal cells agree well with those of the Gl trace observed in Figure 4, trace B, 
for cancerous ML-1 cells. We believe that the "normal" trace in Figure 5B differs from the 

31 abnormal ones by less DNA spectral contributions. This view is justified by the strong DNA 
features at 1230 cm"', and the small DNA shoulder at 970 cm"' in the "abnormal" spectra. 
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1 Figure 6 shows a microscopic image taken of Eu^"" stained cells through a fluorescence 

microscope. The arrow indicates an approximate scale of the display. The Figure demonstrates that 
excellent separation of cells is easily achieved, and that cells can be localized by their fluorescent 
emission. This localization of cells is used to speed up data manipulation for the infrared imaging, 
and may serve as indications whether or not single cells are in the field of view of a given infi-ared 
6 detector element. 

Figure 7 shows a sketch of the most preferred embodiment of the invention. 
Figure 8 shows a sketch of a preferred embodiment of the invention. 
Figure 9 shows a sketch of a preferred embodiment of the invention. 

1 1 DETAILED DESCRIPTION OF THE INVENTION 

SCIENTIFIC ASPECTS OF THE INVENTION 
a) Aspects of Cellular Biology 

Mammalian and other cells undergo division according to a cell cycle scheme depicted in 
Figure 1. Normal cells are found predominantly in a state, referred to as GO, in which no 

16 reproduction occurs. The division process is initiated by certain biochemical signals, upon which 
a number of predetermined processes occur that causes a cell to duplicate itself This duplication 
process may take 20 to 30 hours, and can be divided into phases Gap 1 (Gl), Synthesis (S), Gap 
2 (G20 and Mitosis (M). In these phases, well-established processes take place: for example, in the 
S phases, the DNA strands containing the genetic blueprint of the cell are duplicated, whereas in 

21 the M phase, the actual cell division takes place; i.e., two new cells are created from the progenitor 
cell. 

Cancerous cells can be cultured indefinitely if the cells are kept at proper conditions for cell 
growth. In such cell cultures, the cells re-enter the next division cycle as soon as the previous one 
is completed, and the number of cells doubles after a period of time corresponding in length to the 
26 cell division cycle. Such a cell culture is said to be exponentially growing, and the cells found in 
a given phase of the cycle, i.e. Gl, S, G2 and M, is determined by the relative length of these 
stages, which is about 10 hours for the S phase, 8 hours for Gl, 6 hours for G2, and minutes for 
M. 

In order to obtain cells at given stages of their cycles, cultured cells can be separated into 
31 tractions of good phase homogeneity by a density/size centrifugation called elutriation. 
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1 Subsequently, fluorescence activated cell sorting (FACS) technology can be used to identify these 
fractions as Gl, S or G2 phases by the DNA content. These methods present the opportunity to 
interpret the changes in infrared spectra of cells at various stages of their cell cycles. 

b) Aspects of Vibrational Spectroscopy 

In infrared spectroscopy, the attenuation of the intensity of a beam of infrared light upon 
6 passing through a sample is measured. This attenuation is caused by the interaction of the light with 
the vibrational transitions of the sample molecules. These absorptions of infrared light, when 
plotted against the wavelength of the light, produce a unique fingerprint pattern of the molecules 
encountered by the beam of light. The fingerprint pattern is very useful in identifying entities of 
biological interest, which include but are not limited to cells, proteins, viruses, fluids, etc. Such 

1 1 fingerprint patterns for a number of cellular components are shovm later in this specification. In 
addition, one can assess the degree of packing of certain cellular components from the infrared 
spectral patterns: i.e., we have demonstrated before that a nucleus of a quiescent cell is optically 
so dense that it may not transmit any of the incident infrared radiation, and hence no infrared 
spectral features due to the cell nucleus are measurable. Consequently, infrared spectroscopy and 

16 the appearance of spectral feature due to the nucleus may be used to monitor nuclear processes 
which result in significant changes in the packing of the constituent molecules. 

In infixed microspectroscopy (also referred to as infrared microscopy) the infrared beam 
is passed through the specimen and focused by an infrared microscope that allows infrared spectral 
data to be collected from microscopic particles, such as single cells, or pixels of tissue the size of 

21 a cell. 

The variety of molecules found in a human cell is so staggering that the unambiguous 
assignment of the infrared spectra of a cell' s constituents is not possible, particularly since different 
proteins, in general, have similar vibrational spectra. However, we and others have shown in the 
past that changes in molecular composition can be observed and interpreted reliably. For example, 
26 different protein/nucleic acid ratios, or the overexpression of structural proteins, can be monitored 
via infrared microspectroscopy. 

c) Aspects of hifrared Spectra of Cells and Tissue 

The following section presents a detailed view of recent progress in understanding the 
infrared specfroscopy of cells and tissues. Such a discussion, and the detailed understanding, has 
31 been absent in many of the previous publications and patent applications. Consequently, these 
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1 earlier efforts were based on intuition, rather than scientific results, and reached unfounded claims 
and conclusions. 

We have found that the mfrared spectra of cells and tissues are dominated by the infrared 
spectral patterns of proteins (as shown m Fig. 2 trace A), which constitute the predominant cellular 
component by mass. Although there are hundreds or thousands of different proteins m a cell, the 
6 protein spectrum observed is determined by the most abundant proteins which generally are 
structural protems such as tubuline that determine a cell's overall shape and physical properties. 

Protein infrared absorption spectra are known to vary with the secondary structure of the 
protein (a-helical, pleated sheet, random, etc.), the protein's state of hydration, the solvent's ionic 
strength, etc. However, the averaged infrared spectra of all metabolic and structural proteins found 

1 1 in cells turn out the be remarkably the same for most cells. The only protems that exhibit distinctly 
different spectra are found in connective tissue (e.g., collagen). 

The averaged protein spectra found inside cells are dominated by the amide I band at ca. 
1650 cm-', the amide II vibration at 1530 cm"', the amide III peak at 1245 cm ', and a number of 
side chain vibrations in the 1310, 1390 and 1450 cm"' range. 

1 6 The infrared absorption spectra of RNA and DNA are shown in Figure 2, Trace B and C, 

respectively. These spectra exhibit absorption peaks between 1580 and 1700 cm"' due to the 
aromatic base breathing and C=0 stretching vibrations. The ionized P02' and ribose groups exhibit 
a triad of peaks that occur in DNA at 1071, 1084 and 1095 cm ' with nearly equal intensities. 
Further DNA peaks are observed at 965 and 1245 cm"' (the phosphodiester vibration). In RNA, 

2 1 the peak at 1 085 cm ' is stronger than the two other peaks in this triad, and forms a distinct "nose". 
Furthermore, the intensity ratio of this triad to the phosphodiester peak at 1 245 cm'' is about 1:0.7, 
whereas it is about unity in DNA. Since nucleic acid vibrational spectra in cells in tissues are 
always observed in the presence of protein, we show in Figure 3 expanded regions of the DNA and 
RNA spectra superimposed on protein spectra. 

26 In cells, DNA is found in the nucleus and in mitochondria. RNA, on the other hand, can 

occur in the cytoplasm as ribosomal or transfer RNA (r-RNA and t-RNA), and in the nucleus and 
cytoplasm as messenger RNA (m-RNA). Thus, the cytoplasm is relatively rich in various RNA 
species, whereas the nucleus contains nearly all the DNA. 

Terminally differentiated, and no longer proliferative cells exhibits virtually no DNA 

31 spectral features. Consequently, in these cells mainly the cytoplasmic RNA spectral features are 
observed superimposed on the protein spectrum. In such cells, the nucleus is very small and very 
well delimited. The concentration of DNA, RNA and protein in such a nucleus is quite high, and 



1 leads to an optical density of the nucleus in excess of 1 absorbance unit. However, DNA is not 
distributed uniformly throughout the nucleus; rather, it is wrapped tightly around proteins known 
as histones. Thus, the local concentration of DNA and protein is even higher, and it is likely that 
the chromosomes of an inactive nucleus will appear as "black" (i.e., opaque or nearly opaque) 
strings. Little or no spectral information from the DNA can be collected in this case. 
6 Benedetti et al. presented spectral results that confrnn this hypothesis. They have observed 

cells in which there were 64 copies of nuclear DNA whereas in normal cells, there are two copies 
of DNA . The cells with higher DNA content do not exhibit DNA spectral features stronger than 
those observed in cells with two copies . Interestingly, with the onset of pre-cancerous and 
cancerous disease, the DNA signals become generally more pronounced. This aspect, and models 
1 1 to explain h, will be discussed later in this application. 

Aside from protein and nucleic acids, carbohydrates (in the form of polymeric sugars or 
glycoproteins), phospholipids, water, and a few other compounds may occur in cells and tissues 
at levels that make them detectable via infrared spectroscopy. 

d) Aspects of Infrared Spectra at various Cell Cycle Stages 

1 6 Even cells of very high homogeneity and purity show significant spectral heterogeneity. 

This heterogeneity is attributed to the fact that cells may by found at different stages of the 
reproductive cycle. Healthy and frilly mature cells may not divide at all, whereas cells of the 
proliferative layer of epithelium may undergo slow division. In cancer cells, on the other hand, the 
division cycle may be a continuous process that leads to a rapid growth of the cancer. Cancerous 

21 cells separated according to their reproductive cycle stages {i.e., Gl, S, G2 or M) show spectral 
patterns (Figure 4) that are distinctly different from the patterns observed in normal samples. 
Interpretation of these patterns is of prime importance for the understanding of different spectral 
patterns observed for normal and diseased samples of tissues and cells. As such, the next section 
is the primary piece of intellectual property to be protected by this disclosure. 

26 Figure 4, trace A shows infrared absorption spectra collected from an exponentially 

growing cell culture of myeloid leukemia (ML-1) cells, and cells fractionated into the cell cycle 
phases by elutriation. Identification of the cells in a given fraction was performed by fluorescence 
activated cell sorting (FACS) analysis. 

The S phase spectra in Figure 4 (trace C) resembles that of exponentially growing cells in 

3 1 that it exhibits sfrong vibrations due to DNA. The spectra of cells in the Gl and G2 phases are 
similar to each other, and very different from the S phase spectrum in the low frequency region. 
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1 The shape of the peaks around 1070-1 100 cm"' (the "nose at 1085 cm"') suggests that in Gl and 
G2 phases, mostly RNA is observed. The results for cells in the 01 and G2 phases confirmed the 
hypothesis of opaque DNA: Since cells in the Gl phase are diploid (2 copies of DNA), whereas 
cells in the G2 phase are quadruploid (4 copies of DNA), one would observe stronger DNA signals 
if the nuclear DNA was detectable. However, since cells in the Gl and G2 phases exhibit very 

6 similar spectral features that bear the signatures of RNA, it follows that the infrared spectra do not 
detect the DNA in the nucleus, but rather, the cytoplasmic RNA. The strong contributions of DNA 
in the S phase (Trace C and D), however, are due to the DNA transcription which requires that 
sections of DNA are unwound from the chromatin, and thus, may become partially transparent to 
and detectable by IR radiation. 

1 1 e) Aspects of Spectral Differences between Normal and Abnormal Samples 

When examining healthy and diseased single squamous cells, one observes a mixture of 
different cells of different stages of maturation. Consequently, one observes a large variety of 
different spectral traces from these cells. In order to detect spectral changes due to disease, it is 
advantageous to ignore spectra due to cells that differ by states of maturation or differentiation. 

16 This separation can be accomplished by data analysis (vide infra); i.e., there is no need to 
physically separate the cells in the sample. 

After all data resulting from different stages of maturity are eliminated from the analysis, 
one can arrive at a set of spectral traces of cells that are still proliferative. These are the most 
important for analysis since their progeny cells will carry the same diseased genes. Even among 

21 these cells, one detects spectral inhomogeneity that may be attributed to the different phases of the 
cell cycle. For squamous tissue, for example, the spectral patterns of the immature cells differ 
significantly between normal and abnormal states of health in some of the observable division 
phases. However, prior to interpreting these differences in terms of presence or absence of disease, 
the heterogeneity of the spectral patterns due to the cell's reproductive cycle must be established. 

26 The differences between cancerous and non-cancerous cells can be observed most easily 

in selected cell cycle phases, whereas other phases exhibit virtually indistinguishable traces. Figure 
5 A, for example, shows a comparison between healthy and diseased single cell spectra that are 
virtually superimposable. The low nucleic acid / protein ratio of these spectra suggests that the cells 
are not dividing actively (GO or Gl stages). However, reference spectra for the pure GO phase have 

3 1 not been observed in the ML- 1 cells reported above, since they cycle continuously and never reach 
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1 the inactive phase GO. In normal samples most cells should be found in GO, and even in cancerous 
cells, the majority of all cells should be in GO. 

Figure 5 B shows the differences between normal and abnormal cells in a phase whose 
spectra resemble those observed for the middle of the S phase. The spectra differ enormously in 
the intensity of the peak at 1230 cm"' to the peak at 1300 cm"'. The former of these contains 
6 protein and DNA, whereas the latter is nearly a pure protein peak. Thus, any increase of the 
1230/1300 cm-' peak intensity ratio is an indication of the visibility of the DNA. This is further 
confirmed by the appearance of a sharp DNA peak at 970 cm ' in the cancerous samples. 
Comparing individual spectra from normal and abnormal samples, spectral pairs similar to the ones 
depicted in Figures 5A and 5B can be found. Each spectral pair is thought to arise from a given 
1 1 phase in the cell cycle, and the spectra always differ in the amount of spectral contribution of DNA. 
This is an indication that normal and cancerous single cells in a number of different stages of their 
reproductive cycle, can be distinguished by their detectable DNA content. 

The mtensity ratio of the peaks at 1230/1300 cm' , however, is not a uniformly applicable 
indicator of disease. We found that any cells actively involved in replication or transcription has 
16 a higher visible DNA content than inactive cells. Thus, cell cycle dependent data of cells from 
various organs need to be collected to create a baseline of how much DNA contribution constitute 
an abnormality. 

The following section describes how the information presented above can be used to screen 
exfoliated or biopsied cells, tissues or other biological entities for the occurrence of cancer, 

2 1 precancerous aspects, or other biological or physical abnormalities. In particular, the logical steps 
required to proceed from sample collection to a valid and reliable diagnosis for the most preferred 
embodiment of the invention will be discussed. 

Epithelial cell samples can be derived by scraping the surface of the epithelium with 
suitable devices such as brushes, spatulas, etc., used presently to collect specimens. Samples from 

26 internal organs can be obtained by thin needle aspiration, needle biopsies or surgical biopsies. 
Cellular components of body fluids (lymphocytes and leukocytes) can be isolated directly from 
these body fluids. Standard methods, such as digestion with collagenase to break up tissues into 
individual cells, is utilized to obtain single cells suitable for spectroscopic analysis. Other 
biological specimens such as protems and fragments of DNA, RNA, or other molecules may be 

3 1 obtained by methods very well known in the art. 

Cells and other biological entities obtained in this fashion are expected to be quite 
heterogeneous: exfoliated cells, for example, may contain cells at different stages of maturation. 
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1 whereas lymphocytes and leukocytes may be found at different stages of maturation and 
differentiation. Cells derived from tissue sections may contain endothelial cells (for example, from 
blood vessels). This heterogeneity is expected and will not present a significant problem if the 
spectral data are collected on a cell-by-cell basis, rather than an averaged spectral collection. An 
earlier patent (e.g., Zakim and Lord, US Patent # 5,733,739) reports the use of cell pellets of 

6 unknown and variable composition to carry out cancer screening; however, the similarity of 
spectra of all cells and tissues render such an approach difficult or impossible to implement with 
any reliability. 

The cells or other entities obtained as described above are treated to remove impurities due 
to blood, lymph, mucus or other confounding constituents. Simple separation procedures are 
1 1 useful to enhance the percentage of desirable entities for the investigation. 

The experiments described above reveal that most pronounced differences between normal 
and abnormal cells are observed when cells replicate their DNA. In fully mature and terminally 
differentiated cells, this process no longer occurs; thus, it is advantageous to remove them or 
reduce their numbers. 

16 The collected and purified cells are fixed to prevent degradation of the samples. Flash 

fixing by ethanol (15 sec, low temperature) produces samples of sufficient stability for 
spectroscopic analysis. Longer exposure to ethanol may dissolve the cell membranes, may lead to 
cell fiision and precipitation of cellular components. Subsequently, cells are transferred to one of 
a number of different infrared transparent substrates for spectral analysis for the most preferred 

2 1 embodiment of the invention. 

The spectroscopic sample for the most preferred embodiment consists of a partial layer of 
cells with good separation between the cells. For the most preferred embodiment of the invention, 
for example, 1 0" cells, each occupying about 2x10"^ mm^ and distributed uniformly on an area of 
50 mm^ result in a sample partial layer with an occupation of about 5 % of the surface of the 

26 substrate. Such a sample is suitable for analysis by the method of the invention, since the infrared 
absorption spectrum of each cell alone may be recorded. The samples reported by Zakim and 
Lord oftenhad sample populations 1000-10000 foldhigher. The high population creates non-linear 
absorption effects, retains the solvent in a manner that cannot be controlled, and may be 
responsible for many of the artifacts reported by them. For the method of the present invention, 

3 1 the spectrum of mostly single entities should be recorded. Overlapping a cancerous with a non 
cancerous cell, for example, would lead to a spectrum which is not definitive for either type of cell. 
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1 Cells are most preferably visualized by staining with fluorescent, monatomic dyes (for 

example, isotonic Eu^"^ ion solution). Such staining does not perturb the infrared spectra within the 
limits of detect ability, but permits the cells to be detected' using fluorescence excitation. 
Identification of cells is necessary to avoid data collection from cell debris. An image of a typical 
cell sample under fluorescence excitation in the most preferred embodiment of the invention is 

6 shown in Figure 6 which demonstrates the excellent separation of cells in this sample preparation 
procedure. 

The area of the substrate (ca. 50 mm^ ) that contains the cells is imaged using fluorescent 
and infrared wavelengths as follows. The substrate is inserted in the focal position 70 of a 
combination infrared microspectrometer (also known as an infrared microscope) and fluorescence 
1 1 microscope shovm schematically in figure 7. The infrared microspectrometer is equipped with 
a confocal, infrared-sensitive array detector 71 of 256 x 256 (=65536) individual diode elements 
on an area of about 7x7 mml Such instrumentation is commercially available. Fig. 7 shows is 
the optical arrangement for the infrared/fluorescent microscope combination which is the most 
preferred embodiment of the invention. Infrared light from the interferometer is passed tiirough a 

16 Cassegrain condenser 73 A into the sample, collected via an identical Cassegrain objective 73B, 
and focused with an infrared transmitting lens such as a ZnSe lens 77 onto the confocal array 
detector 71 . Reflecting optics could be used as well in place of the ZnSe lens. In order to reduce 
computation time for the analysis of up to 10,000 individual cells, only pixels that contain cellular 
spectral information is processed. This information is obtained by imaging the sample via 

21 fluorescence microscopy prior to infrared data acquisition. To this end, the cells are illuminated 
by introducing an ultraviolet beam of light 74 from a UV lamp 75 into the optical train using a 
movable mirror 78. The UV light induces intense fluorescence from the cells in tiie focal region 
70 that have been treated with Eu^^ ions. Since this stain consists of monatomic ions, it does not 
exhibit an infrared absorption spectrim; consequently, there is no spectral difference in the infrared 

26 spectrum due to stained and unstained cells. The fluorescent light collected in the reflection mode 
as shown in fig. 7 is filtered to remove the excitation wavelength by a movable dichroic mirror 75, 
and is processed via a CCD detector 76. This image taken by the fluorescence light will indicate, 
by the fluorescent intensities, the position of cells on the sample substrate (cf Figure 6). From 
these positions, one can identify which pixels of the confocal infrared array detector 71 need to 

3 1 be processed to obtain the desired infrared spectral information of the individual cells. Clearly, the 
UV fluorescent image may equally well be taken in transmission mode, and/or tiie spectra of the 
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1 cells may be taken (at less resolution) in reflection mode after reflection from an IR reflecting 

substrate holding the cells 

The analysis of the cells on the substrate proceeds as follows. The sample substrate is first 

illuminated with UV light that is absorbed by the Eu^^ stain with which the cells have been treated. 

The fluorescence induced by the UV excitation is observed in reflection mode of the microscope, 
6 and collected through selective bandpass fihers to reject the excitation wavelength. The 

fluorescence of the stained cells is detected by the CCD camera, and the positions of all cells on 

tiie substrate is computed by the positions of the pixels of the CCD camera detecting the 

fluorescent radiation. 

Subsequently, the illumination of the sample is switched to infrared radiation. The 
1 1 movable mirrors 75 and 78 are removed from the optical path of the microscope and the infrared 
light is passed through a step-scanning interferometer (not shown) to modulate the wavelength 
patterns of the light prior to being directed into the microscope. In the alternative, the movable 
mirror 78 and the dichroic mirror 75 could be formed on IR transparent substrates to reflect the 
appropriate light and transmit the infrared light. Each of the detector elements of the confocal 
16 infrared array detector 71 is exposed to the infixed radiation passing through the sample cells 
which are imaged 1:1 onto the detector resulting in about 1 cell or less per detector element. 

Interferograms are collected for all 65536 detector elements, and converted to infrared 
absorption spectra via a mathematical process known as Fourier transform (FT). Only the 
interferograms from detector pixels onto which cells are imaged are Fourier transformed to save 
2 1 data analysis time. These pixels are identified by the fluorescence picture obtained in the previous 
step. Selecting only picture elements that are known to contain spectral information reduces the 
number of individual spectra to be calculated from the interferograms from 65000 to between 
10,000 and 20,000. 

The resulting spectra are uniformly expanded, smoothed and corrected for water vapor 
26 absorption. Subsequently, the spectra (or their derivative spectra) are clustered using vector 
correlation methods. The clustering reveals the degree of "relatedness" of spectral patterns, and 
will reduce the number of independent observations to a few dozen spectral patterns for each 
sample. After this clustering, certain spectral fraces tiiat are clearly associated with cells of low 
mformation content are discarded. For example, spectra from red blood cells or fully mature, 
3 1 inactive cells may be discarded. The resultant spectral patterns, referred henceforth as the "reduced 
spectral set" is analyzed as follows. 
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1 First, the reduced spectral set is compared to the cell cycle dependent reference sets that 

have been collected beforehand for the organ sample under investigation. This analysis reveals 
whether or not the distribution of the cell cycle stages falls within normal limits. Subsequently, all 
spectra in the reduced data set are searched for the occurrence of spectral patterns associated with 
a fast growing and rapidly dividing cell. The occurrence of such cells in a sample is indicative of 

6 abnormality. 

The analysis described above requires knowledge of the cellular spectra, and cellular 
distribution, observed for normal samples. The analysis, however, depends to a lesser degree on 
bases (reference) sets than the one described by Zakim and Lord, which used spectral averages over 
large number of cells. It is clear that such an averaging process reduces the sensitivity of the 
1 1 method enormously, since only a small number of cells exhibit spectral traces modified by disease. 

The method presented here uses both a statistical analysis of the spectral patterns of the 
individual cells (/.e., changes in the distribution of cells at various stages of their development), 
as well as variations in the spectral patterns due to disease, for a vastly enhances sensitivity of the 
infrared spectral method. 

1 6 A preferred embodiment of the invention is sketched in Fig. 8. A specimen generator 80 

produces a plurality of specimen 82 in the same fashion as a FACS machine or an Inkjet printer. 
The specimen 82 may be droplets containing the entities to be measured, or may be cells or other 
biological entities. The specimen are produced and fly through an atmosphere with a defined 
velocity. The atmosphere may be ordinary laboratory air, or may be air with a controlled water 

2 1 vapor pressure or partial pressure of another vapor such as alcohol, or may be an inert gas such as 
nitrogen or argon, or vacuum. The specimen are generally electrically charged so that the specimen 
generator may accelerate them to a relatively high speed. However, such electrical charging is 
optional. The specimen are optionally marked with a fluorescent marker as detailed above for 
the marking of cells, and UV light source 8 1 and fluorescence detector 83 may be used to mark the 

26 position of a specimen 82, or indeed may be used to decide whether specimen 82 contains entities 
to be investigated. The position and speed of each specimen is now known. A vibrational 
spectrum recording apparatus is now used to record the spectrum of each specimen, and the 
resulting plurality of spectra is treated as detailed above to characterize the statistics of all the 
specimen. In the present embodiment, the preferred vibrational spectrum recording device is two 

3 1 infrared light sources 84 and 85 which direct infrared light on to a specimen 82C, and two detectors 
86 and 87 which measure the light transmitted through specimen 82C. Light sources 84 and 85 
are preferably pulsed laser sources which produce infrared light at frequencies characterizing the 
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1 infrared absorption spectrum of DNA and RNA respectively. A plurality of such lasers may be 
used to produce an entire vibrational spectrum for each specimen as detailed m the most preferred 
embodiment. A computer 88 is shown controlling specimen generator 80 and infrared sources 84 
and 85, and monitoring fluorescence detector 83. Lines indicatmg control and monitor fimctions 
for light source 81 and detectors 86 and 87 are not shown to avoid complexity in fig. 8. 
6 A preferred embodiment of the invention is shown in figure 9, where the separate infrared 

light sources 84 and 85 are replaced by a broad band infrared light source 94. The infrared light 
from the source 94 which is preferably a pulsed broad band laser, is focused on a specimen 82C 
which has been marked as noted above for figure 8. The broadband infrared light is then analyzed 
spectrally as indicated in fig. 9 by a focusing grating 96 which images light transmitted through the 

1 1 cell onto an array detector 98 according to the infrared wavelength. Rays of two different infrared 
wavelengths are traced in fig. 9 in order to guide the eye. The grating 96 could be replaced by an 
imaging system used to image the light from the specimen 82C on to the slit of a normal 
spectrometer and infrared recording array detector, or indeed by any other spectral recording device 
as known in the spectroscopic art. 

16 Lenses conventionally shown in fig. 8 and 9 are figurative only, and imaging and Ught 

handling devices as known in the art are anticipated by the inventors. 

The vibrational spectrum recording devices of the invention are not limited to the devices 
described above, but may be any vibrational spectiaim recording devices known in the 
specti-oscopic art to record IR absorption spectra or Raman spectra. 

1 1 Obviously, many modifications and variations of the present invention are possible in light 

of the above teachings. It is therefore to be understood that, withing the scope of the appended 
claims, the invention may be practiced otherwise then as specifically described. 
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1 We claim: 

1 1 . A method of characterizing a biological specimen, comprising: 

2 a) grouping a very large first plurality of entities into a second plurality of groups, each group 

3 comprising a small number of entities; 

4 b) characterizing each group of entities in the second plurality according to an aspect of the 

5 vibrational spectrum of each group; and 



6 c) statistically analyzing the characteristics of the groups of entities in the second plurality. 
1 2. The method of claim 1, wherein the small number is preponderantly one. 
1 3. The method of claim 1, wherein the entities are cells. 



1 4. The method of claim 1, wherein characterization of each group is the recording of infrared 

2 absorption spectra of the entities in each group. 

1 5. The method of claim 4, wherein the small number is preponderantly one, and wherein the 

2 entities are cells, and wherein the infrared absorption spectrum of each cell is analyzed for 

3 indications that the one cell in each group is in a cell division stage. 

1 6 . The method of claim 5 , wherein the results of the statistical analysis is the percentage of the cells 

2 in the cell division stage. 

1 7. The method of claim 5, wherein the indication that a cell is in a cell division stage is the 

2 presence of a signal indicating DNA in the infrared absorption spectra. 



1 

2 



8. The method of claim 4, wherein the small number is preponderantly one, and wherein entities 
are grouped according to the fluorescence of the entities in each group. 
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1 9. A microscope, comprising: 

2 infrared optics for imaging infrared light transmitted through a large number of entities on an area 

3 of a microscope stage on to a first detector, where the first detector is an infi-ared array 

4 detector; and 

5 optics for imaging fluorescence light emitted by the entities on to a second detector, where the 

6 second detector is a fluorescence light array detector. 

1 10. The microscope of claim 9, further comprising: 

2 a first source of infrared light, the infrared light for illuminating the area of the stage; 

3 an second source of ultraviolet light, the ultraviolet light for illuminating the area of the stage. 

1 11. The microscope of claim 1 0, wherein: 

2 the first detector is an infi^-red area array detector for detecting an image of the entities formed 

3 by the infrared fight transmitted through the entities; 

4 and the second detector is an area array detector for detecting an image of the entities formed by 

5 the fluorescence light emitted by the entities. 

1 12. The microscope of claim 1 1 , wherein the entities are single cells. 

13. The microscope of claim 12, wherein the infrared absorption spectra of each cell is recorded. 

14. The microscope of claim 13, wherein the infi-ared absorption spectiiim of each cell is analyzed 

for indications that the cell is in a cell division stage. 

15. The microscope of claim 14, wherein the percentage of the cells in the cell division stage is 

calculated. 
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16. The microscope of claim 14, wherein the indication that a cell is in a cell division stage is the 



2 presence of a signal indicating DNA in the infrared absorption spectra. 
1 1 7. An apparatus, comprising: 



2 location means for locating a very large number of cells; 



3 vibrational spectrum characterization means for characterizing the vibrational spectrum of each of 

4 the cells located by the location means. 

1 1 8. The apparatus of claim 1 7, wherein the vibrational spectrum characterization means comprises 

2 a means for generating and for transmitting infrared light through each cell. 

1 1 9. The apparatus of claim 1 8, wherein the means for generating infrared light comprises a first 

2 laser having a first defined infi-ared wavelength. 

1 20. The apparatus of claim 19, wherein the first laser is pulsed when the location means locates a 

2 first cell in a position to be characterized by the first laser. 

1 21. The apparatus of claim 19, wherein the first defined wavelength comprises a wavelength 

2 wherein DNA is highly absorbing. 

1 22. The apparatus of claim 21 , wherein a second laser having a second infi-ared wavelength is 

2 pulsed to characterize the first cell, wherein the second infi-ared wavelength comprises a 

3 wavelength wherein RNA is highly absorbing 



1 23. The apparatus of claim 20, wherein the first defined wavelength comprises a wavelength 

2 wherein DNA is highly absorbing. 

1 24. The apparatus of claim 1 8, wherein the means for generating infi-ared light comprises a third 

2 laser having a broad band infrared wavelength range. 
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25. The apparatus of claim 24, wheijein the third laser is pulsed when the location means locates 
a first cell in a position to be characterized by the laser. 

26. The apparatus of claim 25, wherein the broad band infrared wavelength range includes a 

wavelength wherein DNA is highly absorbing. 

27. The apparatus of claim 26, wherein the broad band infrared wavelength range includes a 

wavelength wherein RNA is highly absorbing. 

28. The apparatus of claim 27, wherein the infrared absorption spectrum of each cell is recorded. 

29. The apparatus of claim 28, wherein the infrared absorption spectrum of each cell is analyzed 

for indications that the cell is in a cell division stage. 

30. The apparatus of claim 29, wherein the percentage of the cells in the cell division stage is 

calculated. 

31. The apparatus of claim 30, wherein the indication that a cell is in a cell division stage is the 

presence of a signal indicating DNA in the infrared absorption spectra. 

33. The apparatus of claim 17, wherein the location means is a fluorescence activated sorting 

apparatus 

34. A method of characterizing a large group of biological cells, comprising: 

a) separating the cells so that the cells of the large group are preponderantly separated from each 

other; 

b) characterizing each cell according to an aspect of the vibrational spectrum each cell; and 

c) statistically analyzing the characteristics of the groups cells. 
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35. The method of claim 34, wherem the vibrational spectrum of each cell is the recording of an 
infrared absorption spectrum for each cell. 



36. The method of claim 35, wherein the infrared absorption spectrum of each cell is analyzed for 

indications that the cell is in a cell division stage. 

37. The method of claun 36, wherein the results of the statistical analysis is the percentage of the 

cells of the group which are in a cell division st^e. 

38. The method of claim 37, wherein the indication that a cell is in a cell division stage is the 

presence of a signal indicating DNA in the infrared absorption spectra. 

39. The method of claim 38, wherein the separated cells are located according to the fluorescence 

of the cells. 



1 
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Method of Characterization of Biological Entities 



2 ABSTRACT OF THE INVENTION 

3 An apparatus and method are disclosed for measuring the infrared vibrational spectral 

4 characteristics of each of a large number of biological entities such as cells, and from the 

5 measurements statistically determining the presence of anomalies such as cancer. 
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